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PREFACE 


This  is  the  second  part  of  a two-part  report  that  describes  re- 
search Initiated  at  Rand  in  mid-1975  under  the  Project  RAND  research 
project  "Target  Acquisition."  The  subject  is  "map  matching"  or  image 
correlation  to  achieve  autonomous  target  acquisition  and  terminal 
guidance  for  missiles  (both  strategic  and  tactical) , with  particular 

emphasis  on  the  acquisition  phase. 

* 

Part  I presents  an  analysis  of  the  probabilities  of  correct  and 
false  acquisitions,  extends  it  to  include  the  effeits  of  a number  of 
common  error  sources,  and  describes  computer  simulations  based  on  data 
samples  from  real  scenes.  Part  II  provides  a more  general  and  more 
rigorous  analytical  approach.  Some  of  the  conclusions  derive  jointly 
from  both  phases  of  the  study,  but  Part  II  is  published  separately  be- 
cause it  is  addressed  to  readers  with  a theoretical  and  mathematical 
interest  in  the  subject. 

Both  reports  should  be  of  interest  to  defense  and  Industrial  proj- 
ect managers  and  engineers  involved  in  the  development  of  missile  guid- 
ance, particularly  those  concerned  with  current  or  future  correlator 
programs , 
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SUMMARY 


Image  correlation  or  "map  matching"  makes  possible  a type  of 
weapon  guidance  that  provides  autonomous  target  acquisition  and  track- 
ing. This  study  analyzes  the  image  correlation  process,  both  theoret- 
ically and  by  using  computerized  simulations;  primary  emphasis  is  on 
the  often  neglected  but  crucial  acquisition  phase.  (The  requirement 
for  achieving  adequate  terminal  tracking  accuracy  in  weapon  delivery 
has  been  and  is  being  studied  extensively  elsewhere  and,  for  this  re- 
port, is  considered  to  be  of  secondary  importance;  it  is  simply  assumed 
that,  if  necessary,  operational  systems  could  accommodate  a software 
change  to  maximize  tracking  accuracy  after  the  initial  acquisition  has 
been  accomplished.) 

The  essential  step  in  image  correlation  guidance  is  to  find  the 
position  of  'best  fit"  between  two  similar  but  nonidentical  Images  or 
"maps":  a sensor  image  of  the  terrain  surrounding  a desired  target, 

obtained  in  real  time  as  the  weapon  approaches,  and  a previously  pre- 
pared reference  image  of  roughly  the  same  area.  The  match  point  is 
found  by  systematically  displacing  one  map  relative  to  the  other  and 
computing,  for  each  of  the  many  possible  displacements,  the  value  of 
a comparison  function  or  "metric"  that,  ideally,  has  an  extremum 
(max  or  min)  value  at  the  match  point.  The  particular  displacement, 
suitably  scaled,  that  produces  the  extremum  becomes  the  correction  sig- 
nal for  the  guidance  system. 

Unfortunately,  precisely  because  the  two  maps  are  not  Identical — 
owing  to  detector  noise,  real  changes  in  the  scene,  geometrical  distor- 
tions,  and  several  other  causes  that  are  discussed  in  Part  I of  this 
report — the  displacement  that  produces  the  extremum  does  not  alleys 
correspond  to  the  correct  match  point.  It  only  does  so  on  the  average. 
Accordingly,  this  analysis  of  Part  I focuses  on  two  topics  of  principal 
concern:  (a)  the  probability  of  achieving  a correct  match  (conversely, 
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the  probability  of  a "false  lock"  or.  In  military  terms,  a gross  error) 
and  (b)  the  selection  of  an  appropriate  comparison  metric  to  maximize 
(a). 

Part  1 of  this  report  describes  the  results  of  a direct  approach 

to  determining  the  probability  of  a correct  match,  P , first  for  random 

c 

scenes  with  Gaussian  statistics,  and  then  for  real  scenes  in  the  presence 

of  noise  and  various  errors.  Two  commonly  used  comparison  metrics  are 

calculated  by  using  the  so-called  Product  (a  sum  of  products  that  is 

related  to  classical  correlation)  and  MAD  (mean  absolute  difference) 

algorithms,  respectively.  It  is  shown  that  in  all  cases  P Increases 

c 

with  the  size  of  the  data  sample  and  with  the  elemental  signal-to-noise 
ratio  (S/N),  and  decreases  (slowly)  with  increasing  search  area.  It  is 
also  shown  that  at  low  S/N,  the  Product  algorithm  is  the  preferred  one 
(i.e.,  it  leads  to  higher  probabilities  of  correct  lock),  but  at  high 
S/N,  the  MAD  algorithm  is  preferred.  However,  when  geometrical  errors, 
such  as  synchronization  (an  effect  peculiar  to  digital  systems  in  which 
the  cells  of  the  two  maps  are  staggered  by  some  unknown  fraction  of  a 
picture  element),  rotation,  and  scale  factor  (magnification),  are  pres- 
ent, the  effective  value  of  S/N  is  significantly  reduced.  Thus  the 
higher  values  that  would  render  the  MAD  or  similar  algorithms  attractive 
are  seldom  realized  in  practice. 

The  fundamental  nature  of  the  map-matching  problem  is  reexamined 
in  this  portion  of  the  study  (Part  II),  and  the  degree  of  theoretical 
justification  for  the  use  of  the  various  comparison  metrics  is  also  in- 
vestigated. Since  the  problem  is  basically  one  in  statistical  decision 
theory,  it  is  shown  that  the  optimum  solution  is  achieved  by  computing 
the  likelihood  ratio  for  each  comparison  and  then  choosing  the  match 
point  at  the  place  where  the  likelihood  ratio  is  maximum.  Unfortunately, 
that  computation  requires  a knowledge  of  the  N-dimenslonal  joint  proba- 
bility distributions — functions  that  are  unknown  and,  in  a practical 
sense,  unmeasurable.  Hence,  one  must  resort  to  approximations.  These 
usually  take  the  form  of  maximizing  or  minimizing  one  of  several  func- 
tions, herein  called  "metrics."  In  much  current  work,  these  are  chosen 
almost  arbitrarily  and  therefore  must  be  subjected  to  essentially  experi- 
mental validation.  By  considering  two-plcture-element  scenes,  such  that 
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the  likelihood  ratio  and  several  of  the  commonly  used  metrics  can  be 
expressed  in  simple  algebraic  form  and  discussed  in  geometrical  terms, 
the  essential  features  of  the  various  metrics  are  explained  and  com- 
pared with  the  likelihood  ratio.  In  this  way  heuristic  arguments  are 
developed  that  support  the  use  of  the  Product  algorithm  when  S/N  is  low 
and  the  MAD  algorithm  when  S/N  is  high.  The  notion,  borrowed  from 
signal  detection  theory,  that  the  classical  correlator  (l.e.,  the 
Product  algorithm)  is  optimum  for  this  application  is  shown  to  be  erron- 
eous; the  two  problems  are  fundamentally  different. 

Two  major  conclusions  were  derived  from  this  study.  First,  on  the 

basis  of  the  empirical  results  of  the  simulation  studies  described  in 

Part  I,  it  appears  that  the  theoretically  predicted  values  of  P for 

c 

random  scenes  with  Gaussian  statistics  can  be  considered  as  an  approxi- 
mate lower  bound  for  the  probability  of  acquisition  in  a real  applica- 
tion. The  quantitative  relationships  between  P^  and  various  system 
parameters  that  have  been  derived  for  random  scenes,  and  that  have  been 
largely  confirmed  by  simulation  testing  with  real  scenes,  can  therefore 
be  used  to  carry  out  various  design  tradeoffs,  including  a balancing  of 
the  costs  of  a tighter  overall  P^  requirement  with  the  loss  of  those 
weapons  that  fail  to  acquire.  The  theoretical  model  of  the  random 
Gaussian  scene  is  known  to  be  not  completely  realistic,  but  it  appears 

to  err  on  the  conservative  side.  Thus,  a "floor"  for  P can  be  estab- 

c 

lished,  which  should  permit  the  flight  test  performance  of  future  prop- 
erly designed  systems  to  be  somewhat  better  (i.e.,  to  exhibit  fewer 
gross  errors)  than  is  predicted  by  the  theory. 

The  second  conclusion  is  that  there  ought  to  be  better  algorithms 
than  those  that  have  usually  been  used  in  the  past.  Since  (a)  there 
is  at  present  little  theoretical  basis  for  the  commonly  used  comparison 
metrics,  and  (b)  most  real  terrain  contains  features  beyond  those  de- 
scribable  by  simple  Gaussian  statistics,  it  seems  both  reasonable  and 
not  inconsistent  with  theory  to  search  for  more  efficient  ways  to  carry 
out  the  initial  map-matching  or  target-acquisition  function.  In  par- 
ticular, drastic  preprocessing  to  extract  special  features  of  a given 
scene,  using  techniques  currently  being  developed  and  exploited  in  the 
field  of  pattern  recognition,  should  lead  to  more  efficient  algorithms. 
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I . INTRODUCTION 


Map  matching  Is  fundamentally  a problem  In  statistical  decision 
theory.  During  the  acquisition  phase  it  is  necessary  to  decide  whether 
or  not  the  sensor  and  reference  maps  are  matched.  But  because  of  noise, 
errors,  distortions,  etc.,  possibly  in  each  map,  there  is  no  foolproof 
way  to  make  certain  that  the  maps  have  been  matched  correctly.  The 
most  that  can  be  done  is  to  determine  the  probability  that  the  maps 
are  in  a certain  geometrical  relationship  to  each  other  based  on  the 
available  data.  The  optimum  map-matching  system,  therefore,  is  by  def- 
inition the  system  whose  output  is  the  set  of  a posteriori  probabilities 
describing  each  possible  relationship.  Once  these  probabilities  are 
determined  and  costs  are  assigned  to  each  kind  of  wrong  decision,  the 
decision  rule  is  implicitly  defined  by  the  requirement  that  some  measure 
of  the  total  cost,  e.g.,  the  average  total  cost  of  a decision,  be  a 
minimum.  The  cost  assignment  and  the  details  of  the  resulting  decision 
rule  do  not  affect  the  underlying  structure  of  the  map-matching  system. 
The  decision  rule,  whatever  its  form,  is  necessarily  based  on  the  values 
of  the  a posteriori  probabilities.  Hence,  the  primary  task  is  to  deter- 
mine these  probabilities. 
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II.  STATISTICAL  DECISION  THEORY 


The  basic  relationship  between  a priori  and  a posteriori  probabili- 
ties is  given  by  Bayes'  formula.  For  simplicity  of  exposition,  suppose 
that  only  two  possibilities  are  of  interest,  namely  that  the  maps  are, 
or  are  not,  matched.  Let  these  two  conditions  be  denoted  by  the  sym- 
bols S (signal)  and  B (background),  respectively.  If  a sensor  map 
consisting  of  N data  elements  is  represented  by  the  N-dlmensional  vec- 
tor X,  then  Bayes'  formula  for  the  matched  hypothesis  states  that 

P(s|x)P(x)  = P(x|s)P(S)  , (1) 

= the  a posteriori  conditional  probability  that  the 
maps  are  matched,  given  that  the  sensor  map  has  the 
value  X, 

= the  a priori  probability  that  the  sensor  map  has 
the  value  x, 

= the  conditional  probability  that  the  sensor  map  has 
the  value  x,  given  that  the  maps  are  matched, 

= the  a priori  probability  that  the  maps  are  matched. 

statement  with  S replaced  by  B holds  for  the  unmatched 
Since  there  are  only  two  possibilities,  the  probability 
that  the  sensor  map  has  the  value  x is  simply 

P(x)  = P(xls)P(S)  + P(xjB)P(B)  . (2) 

The  likelihood  ratio  L(x)  is  defined  by 

L(x)  = P(x|s)/P(x|b)  , (3) 

so  that,  after  substituting  (2)  and  (3)  into  (1),  the  a posteriori 
probability  of  a match  can  be  written  as 


where  P(s|x) 

P(x) 

P(x|s) 

P(S) 

An  analogous 
hypothesis. 
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Since  the  probability  that  the  map  assumes  precisely  the  value  x is 
usually  zero,  it  is  to  be  understood  that  the  indeterminate  form  of 
the  likelihood  ratio  is  to  be  replaced  by  the  ratio  of  the  respective 
probability  densities  p(xls)  and  p(x|B).  The  a priori  probabilities 
can  be  considered  to  have  known  constant  values  that  describe  the 
estimates  of  the  position  of  the  flight  vehicle  and  the  direction  of 
the  sensor's  line  of  sight  when  a sensor  map  is  obtained.  Since  the 
a posteriori  probability  is  a simple  monotonic  function  of  the  like- 
lihood ratio,  it  is  clear  that  a knowledge  of  the  likelihood  ratio's 
value  is  equivalent  to  a knowledge  of  the  a posteriori  probability. 
Hence,  the  optimum  map-matching  system  is  nothing  more  than  the  system 
that  computes  the  value  of  the  likelihood  ratio  for  each  comparison 
between  a sensor  map  and  a reference  map. 

In  a purely  formal  sense,  Eq.  (4)  represents  a complete  solution 
to  the  problem.  The  essential  difficulty  in  applying  Eq.  (4)  to  a 
practical  situation,  however,  is  that  the  N-dimensional  joint  proba- 
bility functions  needed  to  form  the  likelihood  ratio  are  usually  un- 
known. A typical  sensor  map  may  consist  of  tens  to  thousands  of 
spatially  correlated  element  values  with  a non-Gaussian  joint  proba- 
bility distribution.  In  the  absence  of  a theoretical  framework  to 
describe  the  form  of  the  distribution,  the  only  recourse  for  deter- 
mining the  distribution  is  direct  measurement,  which,  for  maps  con- 
taining even  as  few  as  three  or  four  elements,  is  a practical 
impossibility. 

Because  the  probability  distributions  needed  to  form  the  li':e- 
lihood  ratio  are  generally  unknown,  and,  in  a practical  sense,  essen- 
tially unmeasurable,  another  approach  to  map  matching  is  required. 


T "rm 
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a vector  and  F(x)  is  a number.  The  first  member  of  the  pair  is  often 
called  the  object  and  the  second  member,  its  value.  Many  metrics  have 
extremal  properties,  l.e.,  for  some  particular  vector,  the  value  of 
the  metric  is  either  a maximum  or  a minimum.  Familiar  examples  of 
such  metrics  are  given  below. 

Th^^  Normalized  Inner  Product  (NProd)  Metric 

The  normalized  inner  Product,  sometimes  called  the  normalized 
Product  (NProd),  is  defined  by 


F(x) 


(5) 


where  y is  some  given  vector,  x • y denotes  the  inner  Product,  and 
jj  jj  denotes  the  norm  of  a vector  defined  by 


||x|)  = (x  • x)  ^ and  j|y|j  = (y  • y)'^  . 


(6) 


The  extremal  property  of  this  metric  follows  from  the  familiar  Cauchy- 
Schwartz  inequality.  Thus,  |f(x)|  ^ 1 for  all  vectors  x ^ y,  with 
equality  occurring  only  when  x = Cy,  where  C is  an  arbitrary  constant. 


The  Difference  Squared  Metric 
This  metric  is  defined  by 

F(x)  = )|x  - yf  , (7) 

where  y is  some  given  vector.  Thus,  F(x)  S 0 for  y.  ^ y , with  equality 
only  when  x = y. 


The  Mean  Absolute  Difference  (MAD)  Metric 
This  metric  is  defined  by 

^ Ui  - yil  , 


(8) 


where  and  denote  tth  components  of  the  vectors  x and  y,  respec- 
tively, and  the  sum  extends  over  all  components.  As  with  the  differ- 
ence squared,  F(x)  S 0 for  x ^ y,  with  equality  only  when  x = y. 

If  the  vectors  x and  y are  associated  with  the  sensor  and  the 
reference  maps,  respectively,  then  a physically  reasonable  basis  for 
attempting  to  adapt  such  metrics  to  the  problem  of  map  matching  im- 
mediately suggests  itself.  If  it  is  assumed  that  when  the  sensor's 
line  of  sight  (i.e.,  the  map  center)  coincides  with  the  target,  the 
sensor  map  "most  nearly"  resembles  the  reference  map,  then  it  is 
reasonable  to  assume  that  the  value  of  the  metric  in  that  case  will 
be  "closest"  to  the  extremal  value.  Because  of  noise  and  other  dis- 
tortions, however,  a perfect  match  is  never  to  be  expected.  Thus, 
some  acceptance  threshold  different  from  the  extremal  value  is  Implied. 
If  it  is  known  beforehand  that  the  sensor  map  is  contained  within  the 
ensemble  of  maps  available  for  comparison,  i.e.,  the  reference  map, 
the  decision  rule  may  simply  decide  that  the  extremal-producing  map 
defines  the  match  point.  If  the  sensor  map  is  not  known  to  be  con- 
tained within  the  reference  map,  the  decision  rule  may  require  the 
value  of  the  extremal  to  be  above  some  prescribed  threshold  value. 

The  obvious  question  that  arises  when  such  metrics  are  used  is, 
"What  metric  should  one  use?  or,  stated  differently.  How  is  an  optimum 
or  at  least  a "good"  metric  selected?  The  answer  clearly  depends  on 
the  map  statistics — which  have  been  completely  ignored  in  the  above 
definitions  of  metrics.  Any  attempt  at  an  analytical  formulation  of 
this  question  leads  directly  back  to  the  original  statement  of  the 
map-matching  problem  as  a problem  in  statistical  decision  theory.  The 
optimum  metric  is  the  likelihood  ratio  defined  by  Eq . (3),  since  it 
essentially  defines  the  most  that  can  be  known,  namely  the  a posteriori 
probabilities  that  the  maps  are  matched  given  the  available  data.  Thus, 
if  a completely  ad  hoc  approach  to  map  matching  is  to  be  avoided,  some 
sort  of  theoretical  analysis  based  on  the  likelihood  ratio  must  be 
undertaken . 

HEURISTIC  INTERPRETATION  OF  THE  LIKELIHOOD  RATIO 

Some  insight  as  to  what  constitutes  a good  metric  can  be  obtained 
if  one  considers  the  likelihood  ratio  for  the  extremely  simple  case  of 
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maps  that  consist  of  only  two  elements.  This  simplification  permits 
the  use  of  elementary  geometrical  concepts  to  illustrate  the  essential 
features  of  various  metrics.  The  extension  of  the  concepts  to  larger 
maps  is  straightforward. 

A single  two-element  map  is  thus  represented  by  a pair  of  values 
^^1*^2^  ■ ensemble  of  possible  sensor  maps  for  the  unmatched  con- 

dition is  then  described  by  its  joint  probability  density  function 
p(Xj^,X2|b).  For  simplicity,  let  the  mean  value  of  this  distribution 
be  assumed  zero.  The  two  values  comprising  a sensor  map  will  generally 
be  correlated,  so  that  the  intersection  of  a horizontal  plane  with  the 
density  function  will  produce  a somewhat  oval-shaped  curve,  as  shown 
by  the  dashed  curve  of  Fig.  1.  (The  exact  shape  of  this  curve  of 


Fig.  1 — Schematic  representation  of  the  map-matching  problem 


intersection  depends,  of  course,  on  the  particular  form  of  the  joint 
probability  density  function.)  Thus,  the  maps  within  this  boundary 
contour  occur  with  a certain  frequency  determined  by  the  height  of 
the  intersecting  plane.  If  the  map  values  are  highly  correlated,  the 
oval  tends  to  be  narrow;  if  the  correlation  is  weak,  the  oval  tends 
toward  a circle. 

Suppose  for  the  moment  that  no  errors  of  any  kind  are  present  in 
either  map.  If  the  values  associated  with  the  sensor  map  are  (Sj^,S2) 
for  the  matched  condition,  the  joint  probability  density  function 
of  the  sensor's  output  is  a two-dimensional  Dirac  delta  function, 
namely. 


p(x|s)  = p(x^,X2|s)  = 6(Xj^  - Sj^)6(x2  - S2)  • 


(9) 


The  likelihood  ratio  in  this  simple  case  is  then 


L(x^,X2)  = 


6(xj^  - s^)6(x2  - S2> 
p(Xj\x^1b) 


(10) 


Thus,  if  the  joint  probability  density  function  p(x^,X2|B)  is  well  be- 
haved, the  likelihood  ratio  itself  has  the  form  of  a delta  function.  The 
decision  rule,  which  identifies  the  match  point  with  the  maximum  value 
of  the  likelihood  ratio,  then  tells  us  to  decide  that  a match  is  pres- 
ent whenever  precisely  the  pair  (Sj^,S2)  occurs.  The  rule,  of  course, 
simply  expresses  the  obvious.  We  know  exactly  what  to  expect  when  a 
match  is  present.  Although  the  occurrence  of  the  pair  may  not  be  an 
unambiguous  indication  of  a particular  match  point,  we  have  no  means 
for  refining  our  decision.  Hence,  the  best  we  can  do  is  to  assume 
that  a match  is  present.  The  situation  is  completely  analogous  to 
finding  a specific  two-digit  number  in  a table  of  numbers,  for  example. 

If  we  know  that  the  table  is  accurate,  we  decide  that  we  have  found 
the  number  when  precisely  the  reference  two-digit  pair  occurs.  In  the 
absence  of  other  information,  every  such  pair  must  be  considered  a 
match  point. 
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Now  suppose  that  both  the  sensor  and  reference  maps  are  subject 
to  error  because  of  noise,  distortions,  etc.  For  simplicity,  suppose 
further  that  all  of  these  errors  have  zero  mean  value,  are  statis- 
tically independent  of  the  uncorrupted  map  values,  and  can  be  considered 
to  be  additive  in  their  effect.  If  all  of  these  effects  are  simply 
called  "noise,"  with  a joint  probability  density  function  p(Xj^,X2)n), 
then,  if  the  noise  is  uncorrelated  and  has  a zero  mean  value,  the 
Intersection  of  the  noise  distribution  with  a horizontal  plane  will 
be  a circle,  as  shown  at  the  origin  in  Fig.  1.  With  these  assumptions, 
the  joint  probability  density  function  p(Xj^,X2|BN)  of  the  corrupted 
sensor  maps  for  the  unmatched  condition  is  p = p * p , where  the 
asterisk  denotes  the  convolution  operation  and  where  the  arguments  of 
the  functions  have  been  dropped  and  the  conditions  incorporated  into 
subscripts  to  simplify  writing.  Similarly,  the  joint  probability 
density  function  p(x^,X2|SN)  of  the  sensor's  output  when  the  maps  are 
matched  is  p^^^  ^ Pn  * Ps  ^ Pn^^I  ” “l’’'2  ~ Therefore,  hori- 

zontal cuts  through  these  two  probability  density  functions  result  in 
contour  curves  somewhat  like  the  solid  curves  of  Fig.  1 that  are 
centered  on  the  origin  and  the  point  (Sj^,S2). 

If  the  noise  is  small  compared  with  the  variance  of  the  uncor- 
rupted sensor  map  values,  the  maximum  value  of  the  likelihood  ratio 
will  occur  in  the  neighborhood  of  the  reference  map  values  (Sj^,S2). 

A horizontal  cut  through  the  likelihood  ratio  near  its  maximum  value 
will  then  result  in  a closed  curve  somewhat  like  that  shown  bounding 
the  shaded  area  in  Fig.  1.  The  occurrence  of  a pair  of  map  values 
close  to  the  reference  values  is  then  an  indication  of  a probable 
match.  Thus,  for  the  assumptions  stated  above,  a metric  that  defines 
a family  of  closed  contours  about  the  reference  map  values  qualita- 
tively resembles  the  likelihood  ratio.  It  is  to  be  emphasized,  how- 
ever, that  the  assumption  that  all  the  errors  affecting  the  maps  can 
be  lumped  together  into  a single,  statistically  independent,  proba- 
bility density  function  with  random  noise-llke  properties  is  a simpli- 
fication made  here  for  illustrative  purposes  only. 
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LIKELIHOOD  RATIO  FOR  GAUSSIAN  DISTRIBUTIONS 

If  it  is  assumed  that  all  the  distributions  have  a Gaussian  form, 
then  an  explicit  representation  of  the  likelihood  ratio  is  obtained 
by  simple  substitution.  Let  it  therefore  be  assumed  that  the  ensemble 
of  possible  sensor  map  values  for  the  unmatched  condition  has  the 
joint  probability  density 


p(Xj^,X2|B)  = 


[-(Xj  - x^)/20^1  - r^)] 


2Tra^(l  - r^)^ 


where  is  the  root-mean-square  variation  of  the  uncorrupted  sensor 

map  values  and  r is  the  correlation  between  the  values  x and  x„. 

B 12 

Similarly,  let  it  be  assumed  that 


p(x^,x^ |n) 


exp  [-(x^  + X2)/2o^J 


where  is  the  root-mean-square  variation  of  the  noise.  Then  by 
forming  p^^^  and  p^^^  and  substituting  into  Eq.  (3),  one  can  show  that 
the  logarithm  of  the  likelihood  ratio  is  proportional  to  the  quadratic 


F(Xj^,X2>  = ^2^ 


[ ^’'l  “ ®1^  ^*2  “ ®2^  ] ’ 


J-  2.2 

®BN  “ °B 


’’bn  ' 1 ’’b  • 

®BN 
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Since  in  practice  the  correlation  is  greater  than  zero  and  less 
than  one,  one  can  show  by  elementary  analytical  geometry  that  this 
equation  defines  an  ellipse  with  axes  rotated  45  deg  with  respect  to 
the  axes.  The  center  of  the  ellipse  is  displaced  from  the  un- 

corrupted map  values  (Sj^,S2),  but  the  displacement  tends  to  zero  as 
the  ratio  of  tends  to  zero,  i.e.,  at  high  values  of  the  slgnal- 

to-noise  ratio.  When  the  correlation  is  zero,  the  ellipse  reduces  to 
a circle.  The  likelihood  ratio  thus  assumes  its  largest  values  when 
the  ratio  of  the  map  values  is  approximately  in  the  ratio  of  the  ref- 
erence map  values,  and  when  the  values  of  the  sensor  map  are  separately 
approximately  equal  to  their  corresponding  reference  map  values. 

When  the  background  variation  is  small  in  comparison  witii  the 
noise,  the  sensor's  output  consists  essentially  of  either  the  desired 
signal  plus  noise,  or  noise  alone.  This  situation  is  depicted  in  Fig. 

2.  The  problem  of  discriminating  between  these  two  cases  is  the  classi- 
cal problem  of  signal  detection.  Its  solution  is  the  well-knovni  matched 
filter  that,  for  the  case  of  Gaussian  noise,  is  the  classical  correla- 
tor, i.e.,  the  unnormalized  product  metric.  This  result  follows  im- 
mediately from  the  likelihood  ratio  (see  Eq . (13))  when  0 and  r are 
set  equal  to  zero.  In  that  case,  the  ellipse  defined  by  the  likelihood 
ratio  degenerates  into  the  straight  line  defined  by 

F(Xj^,X2)  = s^Xj^  + S2X2  . (16) 

The  problem  of  map  matching  has  sometimes  been  confused  with  the 
problem  of  signal  detection.  As  a result,  the  solution  to  the  signal- 
detection  problem,  i.e.,  the  classical  correlator,  has  sometimes  been 
erroneously  Interpreted  as  the  theoretically  optimum  solution  to  the 
map-matching  problem.  The  starting  point  in  many  analyses  of  map 
matching  has  therefore  been  the  calculation  of  the  classical  corre- 
lation function.  The  fundamental  difference  between  the  two  problems, 
however,  is  readily  apparent  when  Figs.  1 and  2 are  compared.  In  the 
case  of  map  matching,  one  needs  to  distinguish  between  signal  plus 
noise  and  background  plus  noise.  In  the  case  of  signal  detection,  it 
is  necessary  to  distinguish  between  signal  plus  noise  and  noise  alone. 


Product  metric 


Fig.  2 — Schematic  representation  of  the  signal-detection  problem 


The  solution  of  either  problem  lies  in  the  calculation  of  the  likeli- 
hood ratio,  which,  for  additive  Gaussian  noise,  takes  the  form  of 
Eq.  (13)  and  the  shaded  ellipse  of  Fig.  1 for  map  matching  and  of 
Eq.  (16)  and  the  straight  line  of  Fig.  2 for  signal  detection. 

The  more  general  case  in  which  the  noise  associated  with  the 
matched  condition  is  different  from  that  for  the  unmatched  condition 
is  discussed  in  the  appendix.  Noise  in  both  cases  is  often  partly  due 
to  geometrical  misalignments  in  angle  or  scale.  In  such  cases,  the 
statistics  of  the  sensor's  map  for  the  matched  condition  will  strongly 
depend  on  the  properties  of  the  map  in  the  vicinity  of  the  nominal 
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match  point.  It  is  shown  in  the  appendix  that  when  the  conic  is  non- 
degenerate, the  locus  of  the  likelihood  ratio  can  be  either  an  ellipse 
hyperbola,  or  parabola,  depending  on  the  relative  variances  of  the 
sum  and  difference  of  the  pixel  values  for  the  matched  and  unmatched 
conditions.  In  most  cases  of  practical  Interest,  however,  neither 
variance  for  the  matched  condition  is  expected  to  exceed  its  corre- 
sponding variance  for  the  unmatched  condition.  Thus,  as  the  analysis 
presented  in  the  appendix  shows,  the  locus  of  the  likelihood  ratio  is 
always  an  ellipse,  which  agrees  with  the  simplified  discussion  above. 

In  summary,  therefore,  we  have  the  following  two  extremes:  wlien 

the  signal-to-nolse  ratio  is  very  large,  the  likelihood  ratio  reduces 
to  a delta  function,  so  that  the  optimum  metric  is  one  that  requires 
each  component  of  the  sensor  map  vector  to  fall  within  an  arbitrarily 
small  distance  from  its  corresponding  reference  map  value.  This  is 
essentially  the  MAD  metric.  When  the  slgnal-to-nolse  ratio  is  very 
small,  the  likelihood  ratio  approaches  the  unnormalized  Product  metric 
These  conclusions,  although  based  on  an  analysis  of  a two-picture- 
element  map,  are  readily  extended  to  maps  of  arbitrary  size. 
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III.  A COMPARISON  OF  THE  LIKELIHOOD  RATIO  FOR 
GAUSSIAN  DISTRIBUTIONS  WITH  OTHER  METRICS 


It  Is  interesting  to  compare  the  metric  defined  by  the  likelihood 
ratio  for  the  Gaussian  case  discussed  in  Section  II  with  some  of  the 
metrics  mentioned  earlier  in  that  section. 

THE_  NORMALIZED  INNER  PRODUCT  (NProd)  METRIC 

In  two  dimensions,  the  normalized  inner  Product  metric  has  the 

form 


F(x^,X2> 


Xj^s^  + X2S2 

+ X2>^(s^  + s 


(17) 


where  the  given  vector,  of  course.  Is  the  reference  map  vector  (Sj^,S2). 
The  values  of  the  metric  that  exceed  some  prescribed  threshold  F are 

_1  o 

those  within  the  angular  sector  ± cos  F centered  about  the  line 

o 

through  the  origin  and  the  point  (s^,S2),  as  shown  in  Fig.  3.  The 
normalized  inner  product  assumes  its  largest  values  when  the  ratio  of 
the  sensor  map  values  X2/Xj^  is  equal  to  the  ratio  of  the  reference  map 
values  82/3^.  Thus,  this  metric  is  appropriate  when  no  a priori  knov;- 
ledge  of  absolute  map  values  is  assumed.  The  only  assumption  is  that 
the  two  data  sets  are  in  a linear  relationship. 

If  the  statistics  of  a sensor  map  are  assumed  to  be  stationary 
and  ergodic,  the  average  norm  for  a sensor  map  containing  N elements 
approaches  the  ensemble  average.  Thus, 

lim  ||x|(/n'*  = Oj.  , (18) 

N-KX. 


where  is  the  root -mean-square  variation  of  the  sensor  map  values 

D 

averaged  over  an  infinite  ensemble.  If  one  assumes  that  the  sensor 


Likelihood  ratio 


Fig.  3 — Comparison  between  metrics 


map  contains  so  many  elements  that  Eq.  (10)  can  be  considered  an 
equality  for  a finite  value  of  N,  then  the  average  norm  associated 
with  both  the  sensor  and  reference  maps  can  be  considered  constant 
for  all  maps.  Except  for  a constant  factor,  Eq.  (17)  reduces  to  the 
classical  correlator  or  tlie  unnormalized  Product  (Prod)  metric  defined 
by 


F(x)  = i (x  . y)  ^ ^9) 

where  y is  the  reference  map  vector.  For  this  case,  computations  using 
either  metric  will  give  approximately  equal  performance. 


I 
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The  distinction  between  the  use  of  the  Product  and  normalized 

Product  metrics  is  well  illustrated  in  the  calculations  of  Sections  II, 

* 

III,  and  IV  of  Part  I of  this  report.  In  Sections  II  and  III,  the 
probability  of  acquisition  is  computed  over  an  infinite  ensemble  so 
that  the  unnormalized  Product  metric  can  be  used.  In  Section  IV,  cal- 
culations are  made  for  a finite  sample  of  maps.  Hence,  the  different 
metrics  produce  different  results. 

THE  DIFFERENCE  SQUARED  METRIC 

The  locus  of  the  difference  squared  metric  defined  by  Eq.  (7)  is 
a circle  centered  at  (s^,S2),  as  shown  in  Fig.  4.  Thus,  the  acceptance 
region  is  similar  to  that  defined  by  the  likelihood  ratio  for  the  spe- 
cial case  of  uncorrelated  map  values,  but  the  location  of  the  center 
of  the  circle  is  not  at  the  optimum  point.  The  displacement  between 
centers  is  minor,  however,  when  the  slgnal-to-noise  ratio  is  large. 

The  metric  takes  no  account  of  the  possible  correlation  between  map 
values  as  does  the  likelihood  ratio. 

THE  MEAN  ABSOLUTE  DIFFERENCE  (MAD)  METRIC 

The  locus  of  the  mean  absolute  difference  (MAD)  is  a family  of 
squares  rotated  45  deg  with  respect  to  the  axes,  also  shown  in 

Fig.  4.  The  properties  of  this  metric  are  thus  similar  to  the  differ- 
ence squared  metric.  The  acceptance  region  is  a small  symmetrical  area 
surrounding  the  reference  map  values.  This  metric  does  not  make  use 
of  the  correlation  between  sensor  map  values. 

PROBABILITY  OF  CORRECT  ACQUISITION 

When  the  desired  map  is  assumed  to  be  present  in  the  ensemble  of 
maps,  and  when  the  extremal  value  of  the  ensemble  is  assumed  to  define 
the  matched  condition,  the  probability  of  correct  acquisition  F^(s) 

■k 

H.  H.  Bailey,  F.  W.  Blackwell,  C.  L.  Lowery,  and  J.  A.  Ratkovic, 
Image  Correlation,  Part  1:  Simulation  a>id  Analycis , The  Rand  Corpora- 

tion, R-2057/1-PR,  November  1976. 
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Fig.  4 — Further  comparison  between  metrics 


for  a maximizing  metric  can  be  defined  as 


P (s) 

c 


00 

■ -] 

1 P(F|S) 

/ p(F' 1b)  dF' 

—00 

J 

— OO 

dF  , 


(20) 


where  p(f|s)  is  the  probability  density  of  the  metric  given  that  a match 
is  present,  p(f|b)  is  the  probability  density  of  the  metric  given  that  a 
match  is  not  present,  and  Q is  the  total  number  of  out-of-register  com- 
parisons. Equation  (20)  is  equivalent  to  Eq . (3)  of  Part  I.  A simple 
change  in  the  limits  of  integration  defines  the  probability  of  correct 
acquisition  for  a minimizing  metric.  The  calculation  of  this  proba- 
bility requires  a knowledge  of  the  probability  density  functions  of 
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the  metric,  which,  in  turn,  depend  on  the  multidimensional  joint  prob- 
ability density  function  of  the  sensor  map  values.  This  joint  proba- 
bility density  function  is  generally  unknown,  so  that  some  sort  of  an 
assumption  regarding  the  form  of  the  distribution  of  the  metric's 
values  must  be  made  in  order  to  use  Eq.  (20)  to  estimate  the  probabil- 
ity of  correct  acquisition.  The  calculations  described  in  Sections  II 
and  III  of  Part  I of  this  report  are  based  on  the  assumption  that  these 
distributions  are  Gaussian. 
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IV.  SUMMARY  AND  CONCLUSIONS 


The  theoretical  solution  to  the  problem  of  map  matching  Is  to  com- 
pute the  likelihood  ratio.  Unfortunately,  the  probability  density  func 
tlons  needed  to  form  the  likelihood  ratio  are  generally  unknown,  and. 

In  a practical  sense,  essentially  unmeasurable.  Hence,  some  sort  of 
theoretical  model  of  the  statistics  of  maps  must  be  developed  or  simply 
assumed  If  an  approach  based  on  the  likelihood  ratio  Is  to  be  pursued. 

The  common  approach  to  map  matching  based  on  the  extremals  of  a 
metric  has  little  theoretical  foundation  beyond  the  simple  notion  of 
continuity.  The  extremal  property  of  a metric  generally  represents  a 
universal  statement  about  the  class  of  all  functions.  Moreover,  the 
extremal  value  Is  associated  with  a particular  function  or  a narrowly 
defined  class  of  functions  whose  values  are  precisely  specified  at 
every  point.  This  corresponds  to  the  Ideal  error-free  (noiseless) 
sensor  map,  which,  of  course,  occurs  with  probability  zero.  By  con- 
tinuity, however.  It  Is  reasonable  to  assume  that  when  the  sensor  map 
Is  close  to  the  Ideal,  the  value  of  the  metric  Is  close  to  the  extremal 

When  the  various  map  errors  can  be  treated  as  additive  noise,  a 
heuristic  Interpretation  of  the  likelihood  ratio  suggests  the  proper- 
ties of  a good  metric.  Wlien  the  noise  Is  .small  compared  with  the  map 
variance,  and  when  the  absolute  amplitude  of  the  sensor  map  Is  cali- 
brated to  that  of  the  reference  map,  a metric  that  requires  an  approxi- 
mate equality  between  corresponding  sensor  map  and  reference  map  values 
should  be  used.  The  MAD  algorithm  Is  such  a metric.  But  when  the 
noise  Is  large  compared  with  the  map  variance,  and  when  no  assumption 
concerning  absolute  signal  levels  is  made,  a metric  requiring  an  ap- 
proximate proportionality  between  corresponding  sensor  map  and  ref- 
erence map  values  should  be  used.  The  normalized  Product  algorithm  is 
such  a metric. 

The  above  conclusions,  although  based  on  an  analysis  of  a si-.tple 
two-picture-element  map,  are  readily  extended  to  maps  of  arbitiarv  size 

The  sometimes  encountered  statement  that  the  classltal  correlati-r , 
i.e.,  the  Product  algoritiim,  is  a theoretically  optimum  approach  t nap 
matching  has  limited  validity.  The  statement  often  derivi's  from  a con- 
fusion between  the  problem  of  map  matching  and  the  problem  of  sign.il 
detection. 


'•V  - 
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Append  lx 

LIKELIHOOD  RATIO  FOR  CORRELATED  BIVARIATE  GAUSSIAN  DISTRIBUTIONS 


Suppose  that  the  joint  probability  densities  for  the  matched  and 
unmatched  conditions  are  denoted  by  p(x,y|SN)  and  p(x,y|BN),  respec- 
tively, where 


p(x,y|SN)  = 


(x  - Sj^)^  - 2r^^(x  - s^Hy  - s^)  + (y  - 
^^SN^^  ~ “^SN^ . 


(A.l) 


p(x,y|BN)  = 


where,  in  general,  ^ and  r^^^  ^ 

The  likelihood  ratio  L(x,y)  is  defined  by 


L(x,y)  = 


p(x,y  SN) 
p(x,y|BN) 


(A.  3) 


Its  logarithm,  in  terms  of  the  two  probability  densities  defined  by 
Eqs.  (A.l)  and  (A. 2),  is 


0^  (1  - r^  x^  - 2r  xy  + y^ 

0 - bn'-  BN''  BN^  y 

X-nL  - 0 2 

°SN^^  ” ^SN^  ^*^BN^^  “ *"bN^ 


(A. 4) 


(x  - - 2r^jj(x  - S^)(y  - s^)  + (y  - 


. ~S.  « • ^ 


-20- 


■ I 


L 


If  Eq.  (A, 4)  Is  rewritten  as  the  quadratic  form 


2 2 

Ax  + Bxy  + Cy  + Dx  + Ey  + F = 0 , 


(A. 5) 


the  coefficients  of  the  quadratic  terms  are  easily  found  to  be 


A = 


= C , 


(A. 6) 


B = 


B 


- "bn> 


(A. 7) 


From  elementary  analytical  geometry,  the  acute  positive  rotation 
angle,  6,  that  transforms  the  conic  to  Its  principal  axes  is  given  by 


tan  20  = 


B 


A - C 


(A. 8) 


If  B as  defined  by  Eq.  (A. 7)  is  assumed  to  be  different  from  zero,  it 
follows  immediately  that  the  principal  axes  of  the  conic  are  rotated 
45  deg  with  respect  to  the  (x,y)  coordinate  axes. 

If  the  conic  is  assumed  to  be  nondegenerate,  l.e.,  if 


A = 


2A  B 

B 2C 

E D 


D 

E 

2F 


* 0 , 


(A. 9) 


then,  from  elementary  analytical  geometry,  the  type  of  conic  Is  deter- 
mined by  the  following  rules: 


1.  If  B - 4aC  = 0,  the  conic  is  a parabola. 
,2 


2.  If  B - 4AC  < 0,  and  A and  A + C disagree  in  sign,  the 
conic  is  an  ellipse. 
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2 

3.  If  B - 4AC  < 0,  and  A and  A + C agree  In  sign,  the  conic 

is  an  imaginary  ellipse,  l.e.,  no  real  locus. 

2 

4.  If  B - 4AC  > 0,  the  conic  is  a hyperbola. 

From  physical  considerations  we  know  that  a real  locus  always 

2 

exists,  so  that  the  condition  B - 4AC  < 0 alone  is  sufficient  to  con- 
clude that  the  conic  is  an  ellipse.  If  W is  defined  as 

W = B^  - 4AC  , (A. 10) 

then,  since  A = C from  Eq.  (A. 6),  we  have 


W = B - 4AC  = (B  - 2A)(B  + 2A)  . 


(A. 11) 


By  combining  Eqs.  (A. 6)  and  (A. 7),  it  is  easy  to  show  that 


B - 2A  = - 


(A. 12) 


and 


B + 2A 


■^B> 


(A. 13) 


From  elementary  probability  theory  a quantity  of  the  form  a (1  + r) 

is  recognized  as  one-half  the  variance  of  the  sum  of  two  correlated  ran- 

2 

dom  variables.  Similarly  a quantity  of  the  form  a (1  - r)  is  recognized 
as  one-half  the  variance  of  the  difference  of  two  correlated  random 
variables.  Using  these  facts,  the  relationship  between  the  type  of 
conic  and  the  properties  of  the  two  probability  densities  is  immediately 
evident  from  Eqs.  (A. 10),  (A. 11),  (A. 12),  and  (A. 13).  When  the  variances 
of  the  sum  and  difference  of  the  pixel  values  for  the  matched  condition 
are  both  either  less  than,  or  greater  than,  the  corresponding  variances 
for  the  unmatched  condition,  the  conic  is  an  ellipse.  When  the  respec- 
tive variances  for  the  matched  and  unmatched  conditions  are  in  an 
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opposite  algebraic  relationship — i.e.,  in  the  matched  condition  one 
variance  exceeds  while  the  other  is  less  than  its  corresponding  var- 
iance for  the  unmatched  condition — the  conic  is  a hyperbola. 

Of  the  various  possibilities,  the  case  in  which  the  variances  in 
the  matched  condition  are  each  less  than  their  corresponding  variances 
in  the  unmatched  condition  is  the  principal  case  of  practical  interest. 
Although  the  variability  of  the  sensor  map  for  the  matched  condition 
depends  on  the  statistics  of  the  map  in  the  vicinity  of  the  match 
point,  its  variation  on  the  average  should  not  exceed  that  associated 
with  the  unmatched  condition.  Hence,  the  acceptance  region  defined  by 
the  likelihood  ratio  will  be  elliptical  in  most  cases  of  practical 
interest. 

The  center  of  the  conic  can  be  found  either  by  completing  the 
squares  in  Eq.  (A. 4)  or  by  finding  the  coordinates  of  the  extremum 
of  the  likelihood  ratio  by  using  elementary  differential  calculus.  If 
the  conic  is  transformed  to  its  principal  axes  by  the  transformation 

u = (x  - y)/2  ^ (A. 14) 


and 

V = (x  + y)2'^  , 


(A.  15) 


it  can  be  shown  that  the  coordinates  of  the  center  (u  ,v  ) are  given 

o o 


by 


u = 
o 


-(Sj^  + s^) 

+ ’■cm) 


(A. 16) 


and 


-(S2  - Sj^) 


(A. 17) 
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where 


P = B - 2A 


(A. 18) 


and 


Q = B + 2A  . (A. 19) 

Thus,  the  conics  for  different  values  of  the  likelihood  ratio  are  all 
concentric. 

Finally,  it  can  be  shown  by  completing  the  squares  in  Eq.  (A. 4) 
that  the  major  and  minor  axes  of  the  conic  are  always  in  a constant 
ratio,  namely 


u : V 


Ipf'^  : iQl*^  . 


(A. 20) 
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The  fundamental  nature  of  the  map-matching  problem 
is  examined  and  theoretical  justification  for  using 
various  comparison  metrics  is  investigated.  Since 
the  problem  is  one  of  statistical  decision  theory, 
the  optimum  solution  is  to  compute  the  likelihood 
ratio  for  each  comparison  and  choose  the  match  point 
at  a place  where  the  likelihood  ratio  is  maximum. 
That  requires  a knowledge  of  N-dimensional  joint 
probability  distributions,  hence,  we  resort  to 
approximations  that  maximize  or  minimize  several 
functions  called'^'*metrics*^  ^ By  considering  two- 
picture-element  scenes,  the  features  of  various 
metrics  are  explained  and  compared  with  the  like- 
lihood ratio.  In  this  way  heuristic  arguments  are 
developed  that  support  the  use  of  the  Product 
aTgorithm-(^— SfeWi-ef  products  that  is  related  to 
classical  correlation)  when  S/N  is  low,  and  the 
MAD  algorithm  (mean  absolute  difference)  when  S/N 
is  high.^(AR) 


